CDS

Accession Number TCMCG075C03103
gbkey CDS
Protein Id XP_017969856.1
Location complement(join(30505686..30505779,30505907..30506097,30506337..30506959,30507110..30507281,30507391..30507466,30507701..30507820,30507905..30508069,30508403..30508493,30508728..30508812,30509185..30509343,30509558..30509614,30509711..30509806,30509895..30509951))
Gene LOC18613466
GeneID 18613466
Organism Theobroma cacao

Protein

Length 661aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018114367.1
Definition PREDICTED: auxin response factor 23 isoform X2 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category K
Description Auxin response factors (ARFs) are transcriptional factors that bind specifically to the DNA sequence 5'-TGTCTC-3' found in the auxin-responsive promoter elements (AuxREs)
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K14486        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04075        [VIEW IN KEGG]
map04075        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCTTCCTTTCCAACTGGTTTTTCGGTTTTCCTTCTTGGGCTTTCCTTTAAAGAGGTCGAGGCATATATGAATAAAGATGGGACAATGGAAATGCCCATCTATAATTTACCTTCTAAGATCCTCTGCAGGGTGGTGCACGTCCAGCTTAAGGCTGAAACTGGCACAGATGAGGTCTTTGCACAAATTACTCTACTTCCAGAGGCAGAGCAAGATGAGCTAAGTATGGAGCATAGAAATTATCAAGCCTTACCTCGAGAAGCTCATCCACGGATCTTGAGCAAGAAACTCACTCCATCTGACACAAACACACATGGTGGATTCTCTGTTCCAAAGCGACATGCTGATGATGGGTGTCTTCCGCCACTGGACATGTCTCAGCATACCCCACAGCAGGATTTGGTCGCGATTGACTTGCATGGTTCTGAATGGCGGTTTCGTCATATTTTTCGTGGTCAGCCAAAAAGGCACTTGCTTACCAGTGGTTGGAGTACATTCGTGACATCAAAGAAGCTTGTTGCTGGGGATACATTTATCTTTCTTAGAGGGGATAATGGAGAGCTTCATGTTGGCGTTCACCGAGCAACAACACTAATGAACAATACATCAACATCTGTGATATCTGGTCACAGCATGAGACATGGTATACTTGCTAGTGCTTTCCATGCCTTTTCTACCAGAAGCATGTTTACTATCTACTACCGCCCTTGGACAAGTTCTTCTGAATTCATCATCCCACTTGATCAGTATATGAAGTCCGCTGAAATTGTTTACTCCATTGGGACAAGATTTAGGATGCAATCTGAAGGCAAAGAATGTGGGGAACAAAGAGCTCTTGGCACTATCATTGGCACTGAAGATGTTGATCACATTAGGTGGCCAAATTCTGAATGGAGATGTCTGAAGGTGAAATTGGATCCCACATCAGATGCAAATTTTCGCCCTGAAAGAGTCTGTCCTTGGAACATTGAACCAATAGAATCCACTAACAGAAAGAAACCTTTCATTTTGCGTCAGCAAAAGAGGGCTCGTACTGATGATGCATCATCCCCTGGGTTTTCTAGCTTGCTTATGGATGGCATGTGGTGTGGCTCAGTTAAATATGAATCTCAAAGTAGCTCAGGGGTCTTGCAAGGTCAAGAAGATGACACAGATGTGAATCAATCCAGTGCGCTAAGACAACCATTGCCACATTTGGTTCTCCCACTACATCCTGATTGTGCCTCAATGCAACCGCAGATGGAGAATCAACTAGAGATTCAGGTTCCGATCTGCAACTCATTTTATCAATGTACCAGCAGCAGAGCACTTTATTCTGGTGGCAAAGTAGCTTGTTTGGGTCTTCATAATAACTGGTCTCCAACATTCTCCTCTTATGGAGTTGATGACGATGCTCTTGCTAGGAGAAAATTTTCAGTTCCATATGTCAATTCTCAGGAATCGAGAACTTTGGAACTAAGGAATGAAAATGAAACTTCACTTTGTGAACCGACCGGTGGTCACAGATGCATGATTTTTGGAGTAAATTTAGTTAATGGTCCACCGGAGCTCCCTTCACGACAAGTTCTCACTTCTAGTGAGCTTAAACGTCTTTGTTCTATTCCTCCAACGTCTCAGTCAAGTGTTTCAGAACCTTCTAAGGTTACATCTAGCAAGCAGTGCAACAACAGTTGCTCTGTCAGCAACCGGAGTTGCACCAAGGTGCTCAAGTATGGGACTGCACTTGGAAGATCAGTTGATCTCACTCGATTTAATGGATATGAAAACCTCATCAGTGAGCTTGATCGAATGTTTGATTTTAAAGGAAGATTGATCAATGGAAGCAGTGGCTGGCATGTAACTTATACTGATGATGAGGGGGACATGATGCTTCTTGGAGATTACCCATGGCAGAAATTTCAGTACGAGGTCCGAAGGATTGTCATCTGCCCAATGGAAGAAATTGACAGACTGAATCAAAGCTCACCAAATTCAACATCTCAATGA
Protein:  
MASFPTGFSVFLLGLSFKEVEAYMNKDGTMEMPIYNLPSKILCRVVHVQLKAETGTDEVFAQITLLPEAEQDELSMEHRNYQALPREAHPRILSKKLTPSDTNTHGGFSVPKRHADDGCLPPLDMSQHTPQQDLVAIDLHGSEWRFRHIFRGQPKRHLLTSGWSTFVTSKKLVAGDTFIFLRGDNGELHVGVHRATTLMNNTSTSVISGHSMRHGILASAFHAFSTRSMFTIYYRPWTSSSEFIIPLDQYMKSAEIVYSIGTRFRMQSEGKECGEQRALGTIIGTEDVDHIRWPNSEWRCLKVKLDPTSDANFRPERVCPWNIEPIESTNRKKPFILRQQKRARTDDASSPGFSSLLMDGMWCGSVKYESQSSSGVLQGQEDDTDVNQSSALRQPLPHLVLPLHPDCASMQPQMENQLEIQVPICNSFYQCTSSRALYSGGKVACLGLHNNWSPTFSSYGVDDDALARRKFSVPYVNSQESRTLELRNENETSLCEPTGGHRCMIFGVNLVNGPPELPSRQVLTSSELKRLCSIPPTSQSSVSEPSKVTSSKQCNNSCSVSNRSCTKVLKYGTALGRSVDLTRFNGYENLISELDRMFDFKGRLINGSSGWHVTYTDDEGDMMLLGDYPWQKFQYEVRRIVICPMEEIDRLNQSSPNSTSQ